173 research outputs found

    FASTCUDA: Open Source FPGA Accelerator & Hardware-Software Codesign Toolset for CUDA Kernels

    Get PDF
    Using FPGAs as hardware accelerators that communicate with a central CPU is becoming a common practice in the embedded design world but there is no standard methodology and toolset to facilitate this path yet. On the other hand, languages such as CUDA and OpenCL provide standard development environments for Graphical Processing Unit (GPU) programming. FASTCUDA is a platform that provides the necessary software toolset, hardware architecture, and design methodology to efficiently adapt the CUDA approach into a new FPGA design flow. With FASTCUDA, the CUDA kernels of a CUDA-based application are partitioned into two groups with minimal user intervention: those that are compiled and executed in parallel software, and those that are synthesized and implemented in hardware. A modern low power FPGA can provide the processing power (via numerous embedded micro-CPUs) and the logic capacity for both the software and hardware implementations of the CUDA kernels. This paper describes the system requirements and the architectural decisions behind the FASTCUDA approach

    Density of States of the lattice Schwinger model

    Get PDF

    Sqrt{shat}_{min} resurrected

    Full text link
    We discuss the use of the variable sqrt{shat}_{min}, which has been proposed in order to measure the hard scale of a multi parton final state event using inclusive quantities only, on a SUSY data sample for a 14 TeV LHC. In its original version, where this variable was proposed on calorimeter level, the direct correlation to the hard scattering scale does not survive when effects from soft physics are taken into account. We here show that when using reconstructed objects instead of calorimeter energy and momenta as input, we manage to actually recover this correlation for the parameter point considered here. We furthermore discuss the effect of including W + jets and t tbar+jets background in our analysis and the use of sqrt{shat}_{min} for the suppression of SM induced background in new physics searches.Comment: 23 pages, 9 figures; v2: 1 figure, several subsections and references as well as new author affiliation added. Corresponds to published versio

    Queue Management in Network Processors

    Get PDF
    Abstract: -One of the main bottlenecks when designing a network processing system is very often its memory subsystem. This is mainly due to the state-of-the-art network links operating at very high speeds and to the fact that in order to support advanced Quality of Service (QoS), a large number of independent queues is desirable. In this paper we analyze the performance bottlenecks of various data memory managers integrated in typical Network Processing Units (NPUs). We expose the performance limitations of software implementations utilizing the RISC processing cores typically found in most NPU architectures and we identify the requirements for hardware assisted memory management in order to achieve wire-speed operation at gigabit per second rates. Furthermore, we describe the architecture and performance of a hardware memory manager that fulfills those requirements. This memory manager, although it is implemented in a reconfigurable technology, it can provide up to 6.2Gbps of aggregate throughput, while handling 32K independent queues

    Forward Jets and Energy Flow in Hadronic Collisions

    Full text link
    We observe that at the Large Hadron Collider, using forward + central detectors, it becomes possible for the first time to carry out calorimetric measurements of the transverse energy flow due to "minijets" accompanying production of two jets separated by a large rapidity interval. We present parton-shower calculations of energy flow observables in a high-energy factorized Monte Carlo framework, designed to take into account QCD logarithmic corrections both in the large rapidity interval and in the hard transverse momentum. Considering events with a forward and a central jet, we examine the energy flow in the interjet region and in the region away from the jets. We discuss the role of these observables to analyze multiple parton collision effects.Comment: 9 pages, 5 figures. Version2: added results on azimuthal distributions and more discussion of energy flow definition using jet clusterin

    FASTER: Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration

    Get PDF
    The FASTER (Facilitating Analysis and Synthesis Technologies for Effective Reconfiguration) EU FP7 project, aims to ease the design and implementation of dynamically changing hardware systems. Our motivation stems from the promise reconfigurable systems hold for achieving high performance and extending product functionality and lifetime via the addition of new features that operate at hardware speed. However, designing a changing hardware system is both challenging and time-consuming. FASTER facilitates the use of reconfigurable technology by providing a complete methodology enabling designers to easily specify, analyze, implement and verify applications on platforms with general-purpose processors and acceleration modules implemented in the latest reconfigurable technology. Our tool-chain supports both coarse- and fine-grain FPGA reconfiguration, while during execution a flexible run-time system manages the reconfigurable resources. We target three applications from different domains. We explore the way each application benefits from reconfiguration, and then we asses them and the FASTER tools, in terms of performance, area consumption and accuracy of analysis

    Vitamin-V: Virtual Environment and Tool-boxing for Trustworthy Development of RISC-V based Cloud Services

    Get PDF
    Vitamin-V is a 2023-2025 Horizon Europe project that aims to develop a complete RISC-V open-source software stack for cloud services with comparable performance to the cloud-dominant x86 counterpart and a powerful virtual execution environment for software development, validation, verification, and test that considers the relevant RISC-V ISA extensions for cloud deployment

    RECO level \sqrt{s}_{min} and subsystem \sqrt{s}_{min}: improved global inclusive variables for measuring the new physics mass scale in missing energy events at hadron colliders

    Get PDF
    The variable \sqrt{s}_{min} was originally proposed in arXiv:0812.1042 as a model-independent, global and fully inclusive measure of the new physics mass scale in missing energy events at hadron colliders. In the original incarnation of \sqrt{s}_{min}, however, the connection to the new physics mass scale was blurred by the effects of the underlying event, most notably initial state radiation and multiple parton interactions. In this paper we advertize two improved variants of the \sqrt{s}_{min} variable, which overcome this problem. First we show that by evaluating the \sqrt{s}_{min} variable at the RECO level, in terms of the reconstructed objects in the event, the effects from the underlying event are significantly diminished and the nice correlation between the peak in the \sqrt{s}_{min}^{(reco)} distribution and the new physics mass scale is restored. Secondly, the underlying event problem can be avoided altogether when the \sqrt{s}_{min} concept is applied to a subsystem of the event which does not involve any QCD jets. We supply an analytic formula for the resulting subsystem \sqrt{s}_{min}^{(sub)} variable and show that its peak exhibits the usual correlation with the mass scale of the particles produced in the subsystem. Finally, we contrast \sqrt{s}_{min} to other popular inclusive variables such as H_T, M_{Tgen} and M_{TTgen}. We illustrate our discussion with several examples from supersymmetry, and with dilepton events from top quark pair production.Comment: 41 pages, 26 figure
    corecore